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IDENTIFICATION OF BROADLY REACTIVE DR RESTRICTED EPITOPES 



CROSS-REFERENCE TO RELATED APPLICATIONS 

5 The present application is a continuation in part of USSN 60/036,713, filed 

January 23, 1997 and 60/037,432 filed February 7, 1997, both of which are incorporated 
herein by reference. 



BACKGROUND OF THE INVENTION 

10 Helper T lymphocytes (HTL) play several important functions in immunity 

to pathogens. Firstly, they provide help for induction of both CTL and antibody 

responses. By both direct contact and by secreting lymphokines such as IL2 and IL4, 

HTL promote and support the expansion and differentiation of T and B cell precursors into 

effector cells. In addition, HTL can also be effectors in their own right, an activity also 
15 mediated by direct cell contact and secretion of lymphokines, such as IFNy and TNFa. 

HTL have been shown to have direct effector activity in case of tumors, as well as viral, 

bacterial, parasitic, and fungal infections. 

HTL recognize a complex formed between Class II MHC molecules and 

antigenic peptides, usually between 10 and 20 residues long, and with an average size of 
20 between 13 and 16 amino acids. Peptide-Class II interactions have been analyzed in 

detail, both at the structural and functional level, and peptide motifs specific for various 

human and mouse Class II molecules have been proposed. 

In the last few years, epitope based vaccines have received considerable 

attention as a possible mean to develop novel prophylactic vaccines and immunotherapeutic 
25 strategies. Selection of appropriate T and B cell epitopes should allow to focus the 

immune system toward conserved epitopes of pathogens which are characterized by high 

sequence variability (such as HIV, HCV and Malaria). 

In addition, focusing the immune response towards selected determinants 

could be of value in the case of various chronic viral diseases and cancer, where T cells 
30 directed against the immunodominant epitopes might have been inactivated while T cells 

specific for subdominant epitopes might have escaped T cell tolerance. The use of epitope 
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based vaccines also allows to avoid "suppressive" T cell determinants which induce TH 2 
responses, in conditions where a THj response is desirable, or vice versa. 

Finally, epitope based vaccines also offer the opportunity to include in the 
vaccine construct epitopes that have been engineered to modulate their potency, either by 
5 increasing MHC binding affinity, or by alteration of its TCR contact residues, or both. 
Inclusion of completely synthetic non-natural or generically unrelated to the pathogen 
epitopes (such as TT derived "universal" epitopes), also represents a possible mean of 
modulating the HTL response toward a TH,, or TH 2 phenotype. 

Once appropriate epitope determinants have been defined, they can be 
10 assorted and delivered by various means, which include lipopeptides, viral delivery 
vectors, particles of viral or synthetic origin, naked or particle absorbed cDNA. 

However, before appropriate epitopes can be defined, one major obstacle 
has to be overcome, namely the very high degree of polymorphism of the MHC molecules 
expressed in the human population. In fact, more than two hundred different types of 
15 HLA Class I and Class II molecules have already been identified. It has been 

demonstrated that in the case of HLA Class I molecules, peptides capable of binding 
several different HLA Class I molecules can be identified. Over 60% of the known HLA 
Class I molecules can, in fact, be grouped in four broad HLA supertypes, characterized by 
similar peptide binding specificities (HLA supermotifs). 
20 In the case of Class III molecules, it is also known that peptides capable of 

binding multiple HLA types and of being immunogenic in the context of different HLA 
molecules do indeed exist. Until now, however, a general method for their identification 
has not been developed, probably at least in part a reflection of the fact that quantitative 
DR binding assays are labor intensive and that a large number of alleles must to be 
25 considered. 

The present invention addresses these and other needs. 
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SUMMARY OF THE INVENTION 

The present invention is based, at least in part, on the discovery and 
validation of specific motifs and assay systems for various DR molecules, representative of 
the worldwide population. Their application to the identification of broadly degenerate 
HLA Class II binding peptides is also described. 

Definitions 

The term "peptide" is used interchangeably with "oligopeptide" in the 
present specification to designate a series of residues, typically L-amino acids, connected 
one to the other typically by peptide bonds between the alpha-amino and carbonyl groups 
of adjacent amino acids. The oligopeptides of the invention are less than about 50 residues 
in length and usually consist of between about 10 and about 30 residues, more usually 
between about 12 and 25, and often between about 15 and about 20 residues. 

An "immunogenic peptide" is a peptide which comprises an allele-specific 
motif such that the peptide will bind an MHC molecule and induce a HTL response. 
Immunogenic peptides of the invention are capable of binding to an appropriate HLA 
molecule and inducing HTL response against the antigen from which the immunogenic 
peptide is derived. 

A "conserved residue" is a conserved amino acid occupying a particular 
position in a peptide motif typically one where the MHC structure may provide a contact 
point with the immunogenic peptide. One to three, typically two, conserved residues 
within a peptide of defined length defines a motif for an immunogenic peptide. These 
residues are typically in close contact with the peptide binding groove, with their side 
chains buried in specific pockets of the groove itself. 

The term "motif" refers to the pattern of residues of defined length, usually 
between about 8 to about 11 amino acids, which is recognized by a particular MHC allele. 

The term "supermotif" refers to motifs that, when present in an 
immunogenic peptide, allow the peptide to bind more than one HLA antigen. The 
supermotif preferably is recognized by at least one HLA allele having a wide distribution 
in the human population, preferably recognized by at least two alleles, more preferably 
recognized by at least three alleles, and most preferably recognized by more than three 
alleles. 
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The phrases "isolated" or "biologically pure" refer to material which is 
substantially or essentially free from components which normally accompany it as found in 
its native state. Thus, the peptides of this invention do not contain materials normally 
associated with their in situ environment, e.g., MHC I molecules on antigen presenting 
cells. Even where a protein has been isolated to a homogenous or dominant band, there 
are trace contaminants in the range of 5-10% of native protein which co-purify with the 
desired protein. Isolated peptides of this invention do not contain such endogenous co- 
purified protein. 

The term "residue" refers to an amino acid or amino acid mimetic 
incorporated in an oligopeptide by an amide bond or amide bond mimetic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR4w4 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 

Figure 2A presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR1 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 

Figure 2B presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR7 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 



treating or diagnosing a number of pathological states such as viral, fungal, bacterial and 
parasitic diseases and cancers. In particular, it provides novel peptides capable of binding 
selected major histocompatibility complex (MHC) class II molecules and inducing an 
immune response. 

Peptide binding to MHC molecules is determined by the allelic type of the 
MHC molecule and the amino acid sequence of the peptide. MHC class I-binding peptides 
usually contain within their sequence two conserved ("anchor") residues that interact with 
corresponding binding pockets in the MHC molecule. Specific combination of anchor 



DESCRIPTION OF THE PREFERRED EMBODIMENT 



The present invention relates to compositions and methods for preventing, 
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5 



residues (usually referred to as "MHC motifs") required for binding by several allelic 
forms of human MHC (HLA, histocompatibility leukocyte antigens) are described in 
International Applications WO 94/03205 and WO 94/20127. Definition of specific MHC 
motifs allows one to predict from the amino acid sequence of an individual protein, which 
5 peptides have the potential of being immunogenic for CTL. These applications describe 
methods for preparation and use of immunogenic peptides in the treatment of disease. The 
peptides described here can also be used as helper T peptides in combination with peptides 
which induce a CTL response. This is described in WO 95/07077. 



10 them can be administered to mammals, particularly humans, for prophylactic and/or 

therapeutic purposes. The DR peptides can be used to enhance immune responses against 
other immunogens administered with the peptides, when the peptides of the invention are 
used as helper peptides. For instance, mixtures of peptides of the invention in 
combination with peptides that induce CTL responses may be used to treat and/or prevent 

15 viral infection and cancer. Alternatively, immunogens which induce antibody responses 
can be used. Examples of diseases which can be treated using the immunogenic mixtures 
of DR peptides and other immunogens include prostate cancer, hepatitis B, hepatitis C, 
AIDS, renal carcinoma, cervical carcinoma, lymphoma, CMV and condyloma 
acuminatum. 

20 The DR-binding peptides or nucleic acids encoding them may also be used 

to treat a variety of conditions involving unwanted T cell reactivity. Examples of diseases 
which can be treated using DR-binding peptides include autoimmune diseases (e.g., 
rheumatoid arthritis, multiple sclerosis, and myasthenia gravis), allograft rejection, 
allergies (e.g., pollen allergies), lyme disease, hepatitis, LCMV, poststreptococcal 

25 endocarditis, or glomerulonephritis, and food hypersensitivities. 

In therapeutic applications, the immunogenic compositions or the DR- 
binding peptides or nucleic acids of the invention are administered to an individual already 
suffering from cancer, autoimmune disease, or infected with the virus of interest. Those 
in the incubation phase or the acute phase of the disease may be treated with the DR- 

30 binding peptides or immunogenic conjugates separately or in conjunction with other 
treatments, as appropriate. 



The DR-binding peptides of the present invention or nucleic acids encoding 
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In therapeutic applications, compositions comprising immunogenic 
compositions are administered to a patient in an amount sufficient to elicit an effective 
immune response to the virus or tumor antigen and to cure or at least partially arrest 
symptoms and/or complications. Similarly, compositions comprising DR-binding peptides 
5 are administered in an amount sufficient to cure or at least partially arrest the symptoms of 
the disease and its complications. An amount adequate to accomplish this is defined as 
"therapeutically effective dose." Amounts effective for this use will depend on, e.g., the 
peptide composition, the manner of administration, the stage and severity of the disease 
being treated, the weight and general state of health of the patient, and the judgment of the 

10 prescribing physician. 

Therapeutically effective amounts of the immunogenic compositions of the 
present invention generally range for the initial immunization (that is for therapeutic or 
prophylactic administration) from about 1.0 fig to about 10,000 fig of peptide for a 70 kg 
patient, usually from about 100 to about 8000 fig, and preferably between about 200 and 

15 about 6000 fig. These doses are followed by boosting dosages of from about 1.0 fig to 

about 1000 fig of peptide pursuant to a boosting regimen over weeks to months depending 
upon the patients response and condition by measuring specific immunogenic activity in 
the patients blood. 

It must be kept in mind that the compositions of the present invention may 

20 generally be employed in serious disease states, that is, life-threatening or potentially life- 
threatening situations. In such cases, in view of the minimization of extraneous substances 
and the relative nontoxic nature of the conjugates, it is possible and may be felt desirable 
by the treating physician to administer substantial excesses of these compositions. 

For prophylactic use, administration should be given to risk groups. For 

25 example, protection against malaria, hepatitis, or AIDS may be accomplished by 

prophylactically administering compositions of the invention, thereby increasing immune 
capacity. Therapeutic administration may begin at the first sign of disease or the detection 
or surgical removal of tumors or shortly after diagnosis in the case of acute infection. 
This is followed by boosting doses until at least symptoms are substantially abated and for 

30 a period thereafter. In chronic infection, loading doses followed by boosting doses may be 
required. 
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Treatment of an infected individual with the compositions of the invention 
may hasten resolution of the infection in acutely infected individuals. For those 
individuals susceptible (or predisposed) to developing chronic infection the compositions 
are particularly useful in methods for preventing the evolution from acute to chronic 

5 infection. Where the susceptible individuals are identified prior to or during infection, for 
instance, as described herein, the composition can be targeted to them, minimizing need 
for administration to a larger population. 

The peptide mixtures or conjugates can also be used for the treatment of 
chronic infection and to stimulate the immune system to eliminate virus-infected cells in 

10 carriers. It is important to provide an amount of immuno-potentiating peptide in a 

formulation and mode of administration sufficient to effectively stimulate a cytotoxic T 
cell response. Thus, for treatment of chronic infection, a representative dose is in the 
range of about 1 .0 jig to about 5000 Mg, preferably about 5 /*g to 1000 ftg for a 70 kg 
patient per dose. Immunizing doses followed by boosting doses at established intervals, 

15 e.g. , from one to four weeks, may be required, possibly for a prolonged period of time to 
effectively immunize an individual. In the case of chronic infection, administration should 
continue until at least clinical symptoms or laboratory tests indicate that the viral infection 
has been eliminated or substantially abated and for a period thereafter. 

The pharmaceutical compositions for therapeutic or prophylactic treatment 

20 are intended for parenteral, topical, oral or local administration. Typically, the 
pharmaceutical compositions are administered parenterally, e.g., intravenously, 
subcutaneously, intradermally, or intramuscularly. Because of the ease of administration, 
the vaccine compositions of the invention are particularly suitable for oral administration. 
Thus, the invention provides compositions for parenteral administration which comprise a 

25 solution of the peptides or conjugates dissolved or suspended in an acceptable carrier, 
preferably an aqueous carrier. A variety of aqueous carriers may be used, e.g., water, 
buffered water, 0.9% saline, 0.3% glycine, hyaluronic acid and the like. These 
compositions may be sterilized by conventional, well known sterilization techniques, or 
may be sterile filtered. The resulting aqueous solutions may be packaged for use as is, or 

30 lyophilized, the lyophilized preparation being combined with a sterile solution prior to 
administration. The compositions may contain pharmaceutical^ acceptable auxiliary 
substances as required to approximate physiological conditions, such as pH adjusting and 
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buffering agents, tonicity adjusting agents, wetting agents and the like, for example, 
sodium acetate, sodium lactate, sodium chloride, potassium chloride, calcium chloride, 
sorbitan monolaurate, triethanolamine oleate, etc. 

The concentration of DR and/or CTL stimulatory peptides of the invention 
5 in the pharmaceutical formulations can vary widely, i.e., from less than about 0. 1 %, 

usually at or at least about 2% to as much as 20% to 50% or more by weight, and will be 
selected primarily by fluid volumes, viscosities, etc., in accordance with the particular 
mode of administration selected. 

The peptides and conjugates of the invention may also be administered via 

10 liposomes, which serve to target the conjugates to a particular tissue, such as lymphoid 
tissue, or targeted selectively to infected cells, as well as increase the half-life of the 
peptide composition. Liposomes include emulsions, foams, micelles, insoluble 
monolayers, liquid crystals, phospholipid dispersions, lamellar layers and the like. In 
these preparations the peptide to be delivered is incorporated as part of a liposome, alone 

15 or in conjunction with a molecule which binds to, e.g. , a receptor prevalent among 

lymphoid cells, such as monoclonal antibodies which bind to the CD45 antigen, or with 
other therapeutic or immunogenic compositions. Thus, liposomes filled with a desired 
peptide or conjugate of the invention can be directed to the site of lymphoid cells, where 
the liposomes then deliver the selected therapeutic/ immunogenic peptide compositions. 

20 Liposomes for use in the invention are formed from standard vesicle-forming lipids, which 
generally include neutral and negatively charged phospholipids and a sterol, such as 
cholesterol. The selection of lipids is generally guided by consideration of, e.g., liposome 
size, acid lability and stability of the liposomes in the blood stream. A variety of methods 
are available for preparing liposomes, as described in, e.g., Szoka, et aL, Ann. Rev. 

25 Biophys. Bioeng. 9, 467 (1980), U.S. Patent Nos. 4,235,871, 4,501,728, 4,837,028, and 
5,019,369, incorporated herein by reference. 

For targeting to the immune cells, a ligand to be incorporated into the 
liposome can include, e.g., antibodies or fragments thereof specific for cell surface 
determinants of the desired immune system cells. A liposome suspension containing a 

30 peptide or conjugate may be administered intravenously, locally, topically, etc. in a dose 
which varies according to, inter alia, the manner of administration, the conjugate being 
delivered, and the stage of the disease being treated. 
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Alternatively, DNA or RNA encoding one or more DR peptides and a 
polypeptide containing one or more CTL epitopes or antibody inducing epitopes may be 
introduced into patients to obtain an immune response to the polypeptides which the 
nucleic acid encodes. Wolff, et. al, Science 247: 1465-1468 (1990) describes the use of 
5 nucleic acids to produce expression of the genes which the nucleic acids encode. Such use 
is also disclosed in U.S. Patent Nos. 5,580,859 and 5,589,466. The nucleic acids can also 
be administered using ballistic delivery as described, for instance, in U.S. Patent No. 
5,204,253. Particles comprised solely of DNA can be administered. Alternatively, DNA 
can be adhered to particles, such as gold particles. The nucleci acids can also be delivered 

10 complexed to cationic compounds, such as cationic lipids. Lip id-mediated gene delivery 
methods are described, for instance, in WO 96/18372; WO 93/24640; Mannino and 
Gould-Fogerite (1988) BioTechniques 6(7): 682-691; Rose U.S. Pat No. 5,279,833; WO 
91/06309; and Feigner et al (1987) Proc. Natl Acad. ScL USA 84: 7413-7414. The 
peptides of the invention can also be expressed by attenuated viral hosts, such as vaccinia 

15 or fowlpox. This approach involves the use of vaccinia virus as a vector to express 

nucleotide sequences that encode the peptides of the invention. Upon introduction into an 
acutely or chronically infected host or into a noninfected host, the recombinant vaccinia 
virus expresses the immunogenic peptide, and thereby elicits a host CTL response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 

20 Patent No. 4,722,848, incorporated herein by reference. Another vector is BCG (Bacille 
Calmette Guerin). BCG vectors are described in Stover et al. ( Nature 351:456-460 
(1991)) which is incorporated herein by reference. A wide variety of other vectors useful 
for therapeutic administration or immunization of the peptides of the invention, e.g., 
Salmonella typ hi vectors and the like, will be apparent to those skilled in the art from the 

25 description herein. 

A preferred means of administering nucleic acids encoding the peptides of 
the invention uses minigene constructs encoding multiple peptides of the invention along 
with CTL inducing peptides. To create a DNA sequence encoding the selected DR 
peptides and CTL epitopes for expression in human cells, the amino acid sequences of the 

30 epitopes are reverse translated. A human codon usage table is used to guide the codon 
choice for each amino acid. These epitope-encoding DNA sequences are directly 
adjoined, creating a continuous polypeptide sequence. To optimize expression and/or 



WO 98/32456 




PCT/US98/01373 



immunogenicity, additional elements can be incorporated into the minigene design. 
Examples of amino acid sequence that could be reverse translated and included in the 
minigene sequence include: DR peptides of the invention, a leader (signal) sequence, one 
or more CTL epitope, and an endoplasmic reticulum retention signal. In addition, MHC 
5 presentation of CTL epitopes may be improved by including synthetic (e.g. poly-alanine) 
or naturally-occurring flanking sequences adjacent to the CTL epitopes. 

The minigene sequence is converted to DNA by assembling oligonucleotides 
that encode the plus and minus strands of the minigene. Overlapping oligonucleotides (30- 
100 bases long) are synthesized, phosphorylated, purified and annealed under appropriate 

10 conditions using well known techniques, he ends of the oligonucleotides are joined using 
T4 DNA ligase. This synthetic minigene, encoding the CTL epitope polypeptide, can then 
cloned into a desired expression vector. 

Standard regulatory sequences well known to those of skill in the art are 
included in the vector to ensure expression in the target cells. Several vector elements are 

15 required: a promoter with a down-stream cloning site for minigene insertion; a 
polyadenylation signal for efficient transcription termination; an £. coli origin of 
replication; and an E. coli selectable marker (e.g. ampicillin or kanamycin resistance). 
Numerous promoters can be used for this purpose, e.g., the human cytomegalovirus 
(hCMV) promoter. See, U.S. Patent Nos. 5,580,859 and 5,589,466 for other suitable 

20 promoter sequences. 

Additional vector modifications may be desired to optimize minigene 
expression and immunogenicity. In some cases, introns are required for efficient gene 
expression, and one or more synthetic or naturally-occurring introns could be incorporated 
into the transcribed region of the minigene. The inclusion of mRNA stabilization 

25 sequences can also be considered for increasing minigene expression. It has recently been 
proposed that immunostimulatory sequences (ISSs or CpGs) play a role in the 
immunogenicity of DNA vaccines. These sequences could be included in the vector, 
outside the minigene coding sequence, if found to enhance immunogenicity. 

In some embodiments, a bicistronic expression vector, to allow production 

30 of the minigene-encoded epitopes and a second protein included to enhance or decrease 

immunogenicity can be used. Examples of proteins or polypeptides that could beneficially 
enhance the immune response if co-expressed include cytokines (e.g., IL2, IL12, GM- 
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CSF), cytokine-inducing molecules (e.g. LeIF) or costimulatory molecules. The HTL 
epitopes of the invention could be joined to intracellular targeting signals and expressed 
separately from the CTL epitopes. This would allow direction of the HTL epitopes to a 
cell compartment different than the CTL epitopes. If required, this could facilitate more 
5 efficient entry of HTL epitopes into the MHC class II pathway, thereby improving CTL 
induction. In contrast to CTL induction, specifically decreasing the immune response by 
co-expression of immunosuppressive molecules (e.g. TGF-P) may be beneficial in certain 
diseases. 



10 poly linker region downstream of the promoter. This plasmid is transformed into an 
appropriate E. coli strain, and DNA is prepared using standard techniques. The 
orientation and DNA sequence of the minigene, as well as all other elements included in 
the vector, are confirmed using restriction mapping and DNA sequence analysis. Bacterial 
cells harboring the correct plasmid can be stored as a master cell bank and a working cell 

15 bank. 



coli, followed by purification. Aliquots from the working cell bank are used to inoculate 
fermentation medium (such as Terrific Broth), and grown to saturation in shaker flasks or 
a bioreactor according to well known techniques. Plasmid DNA can be purified using 
20 standard bioseparation technologies such as solid phase anion-exchange resins supplied by 
Quiagen. If required, supercoiled DNA can be isolated from the open circular and linear 
forms using gel electrophoresis or other methods. 



formulations. The simplest of these is reconstitution of lyophilized DNA in sterile 
25 phosphate-buffer saline (PBS). A variety of methods have been described, and new 
techniques may become available. As noted above, nucleic acids are conveniently 
formulated with cationic lipids. In addition, glycolipids, fusogenic liposomes, peptides 
and compounds referred to collectively as protective, interactive, non-condensing (PINC) 
could also be complexed to purified plasmid DNA to influence variables such as stability, 
30 intramuscular dispersion, or trafficking to specific organs or cell types. 



Once an expression vector is selected, the minigene is cloned into the 



Therapeutic quantities of plasmid DNA are produced by fermentation in E. 



Purified plasmid DNA can be prepared for injection using a variety of 



Target cell sensitization can be used as a functional assay for expression and 
MHC class I presentation of minigene-encoded CTL epitopes. The plasmid DNA is 
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introduced into a mammalian cell line that is suitable as a target for standard CTL 
chromium release assays. The transfection method used will be dependent on the final 
formulation. Electroporation can be used for "naked" DNA, whereas cationic lipids allow 
direct in vitro transfection. A plasmid expressing green fluorescent protein (GFP) can be 
5 co-transfected to allow enrichment of transfected cells using fluorescence activated cell 
sorting (FACS). These cells are then chromium-51 labeled and used as target cells for 
epitope-specific CTL lines. Cytolysis, detected by 51Cr release, indicates production of 
MHC presentation of minigene-encoded CTL epitopes. 

In vivo immunogenicity is a second approach for functional testing of 

10 minigene DNA formulations. Transgenic mice expressing appropriate human MHC 

molecules are immunized with the DNA product. The dose and route of administration 
are formulation dependent (e.g. IM for DNA in PBS, IP for lipid-complexed DNA). 
Twenty-one days after immunization, splenocytes are harvested and restimulated for 1 
week in the presence of peptides encoding each epitope being tested. These effector cells 

15 (CTLs) are assayed for cytolysis of peptide-loaded, chromium-51 labeled target cells using 
standard techniques. Lysis of target cells sensitized by MHC loading of peptides 
corresponding to minigene-encoded epitopes demonstrates DNA vaccine function for in 
vivo induction of CTLs. 

For solid compositions, conventional nontoxic solid carriers may be used 

20 which include, for example, pharmaceutical grades of mannitol, lactose, starch, 

magnesium stearate, sodium saccharin, talcum, cellulose, glucose, sucrose, magnesium 
carbonate, and the like. For oral administration, a pharmaceutical^ acceptable nontoxic 
composition is formed by incorporating any of the normally employed excipients, such as 
those carriers previously listed, and generally 10-95% of active ingredient, that is, one or 

25 more conjugates of the invention, and more preferably at a concentration of 25%-75%. 

For aerosol administration, the peptides are preferably supplied in finely 
divided form along with a surfactant and propellant. Typical percentages of conjugates are 
0.01%-20% by weight, preferably 1%-10%. The surfactant must, of course, be nontoxic, 
and preferably soluble in the propellant. Representative of such agents are the esters or 

30 partial esters of fatty acids containing from 6 to 22 carbon atoms, such as caproic, 

octanoic, lauric, palmitic, stearic, linoleic, linolenic, olesteric and oleic acids with an 
aliphatic polyhydric alcohol or its cyclic anhydride. Mixed esters, such as mixed or 
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natural glycerides may be employed. The surfactant may constitute 0. 1 % -20% by weight 
of the composition, preferably 0.25-5%. The balance of the composition is ordinarily 
propellant. A carrier can also be included, as desired, as with, e.g., lecithin for intranasal 
delivery. 



as an active ingredient an immunogenically effective amount of an immunogenic DR 
peptide or a CTL\DR peptide conjugate or nucleic acid encoding them as described herein. 
The conjugate(s) may be introduced into a host, including humans, linked to its own 
carrier or as a homopolymer or heteropolymer of active peptide units. Such a polymer has 

10 the advantage of increased immunological reaction and, where different peptides are used 
to make up the polymer, the additional ability to induce antibodies and/or CTLs that react 
with different antigenic determinants of the virus or tumor cells. Useful carriers are well 
known in the art, and include, e.g., thyroglobulin, albumins such as bovine serum 
albumin, tetanus toxoid, polyamino acids such as poly (lysine: glutamic acid), hepatitis B 

15 virus core protein, hepatitis B virus recombinant vaccine and the like. The vaccines can 
also contain a physiologically tolerable (acceptable) diluent such as water, phosphate 
buffered saline, or saline, and further typically include an adjuvant. Adjuvants such as 
incomplete Freund's adjuvant, aluminum phosphate, aluminum hydroxide, or alum are 
materials well known in the art. And, as mentioned above, CTL responses can be primed 

20 by conjugating peptides of the invention to lipids, such as P 3 CSS. Upon immunization 

with a peptide composition as described herein, via injection, aerosol, oral, transdermal or 
other route, the immune system of the host responds to the vaccine by producing large 
amounts of CTLs specific for the desired antigen, and the host becomes at least partially 
immune to later infection, or resistant to developing chronic infection. 

25 Vaccine compositions containing the DR peptides of the invention are 

administered to a patient susceptible to or otherwise at risk of disease, such as viral 
infection or cancer to elicit an immune response against the antigen and thus enhance the 
patient's own immune response capabilities, for instance with CTL epitopes described in 
**. Such an amount is defined to be an "immunogenically effective dose. " In this use, the 

30 precise amounts again depend on the patient's state of health and weight, the mode of 

administration, the nature of the formulation, etc., but generally range from about 1.0 fig 



5 



In another aspect the present invention is directed to vaccines which contain 
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to about 5000 per 70 kilogram patient, more commonly from about 10 /zg to about 500 
fig per 70 kg of body weight. 



invention with vaccines which induce neutralizing antibody responses to the virus of 
interest, particularly to viral envelope antigens. For instance, PADRE peptides can be 
combined with hepatitis vaccines to increase potency or broaden population coverage. 
Suitable hepatitis vaccines that can be used in this manner include, Recombivax HB® 
(Merck) and Engerix-B (Smith-Kline). 



also be expressed by attenuated viral hosts, such as vaccinia or fowlpox. This approach 
involves the use of vaccinia virus as a vector to express nucleotide sequences that encode 
the peptides of the invention. Upon introduction into an acutely or chronically infected 
host or into a non-infected host, the recombinant vaccinia virus expresses the immunogenic 
peptide, and thereby elicits a host CTL response. Vaccinia vectors and methods useful in 
immunization protocols are described in, e.g., U.S. Patent No. 4,722,848, incorporated 
herein by reference. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et al., Nature 351, 456-460 (1991)) which is incorporated herein by 
reference. A wide variety of other vectors useful for therapeutic administration or 
immunization of the peptides of the invention, e.g., Salmonella typhi vectors and the like, 
will be apparent to those skilled in the art from the description herein. 



resulting CTL can be used to treat chronic infections (viral or bacterial) or tumors in 
patients that do not respond to other conventional forms of therapy, or will not respond to 
a peptide vaccine approach of therapy. Ex vivo CTL responses to a particular pathogen 
(infectious agent or tumor antigen) are induced by incubating in tissue culture the patient's 
CTL precursor cells (CTLp) together with a source of antigen-presenting cells (APC) and 
the appropriate immunogenic peptide. After an appropriate incubation time (typically 1-4 
weeks), in which the CTLp are activated and mature and expand into effector CTL, the 
cells are infused back into the patient, where they will destroy their specific target cell (an 
infected cell or a tumor cell). 

The peptides of this invention may also be used to make monoclonal 
antibodies. Such antibodies may be useful as potential diagnostic or therapeutic agents. 



In some instances it may be desirable to combine the peptide vaccines of the 



For therapeutic or immunization purposes, the peptides of the invention can 



Antigenic conjugates may be used to elicit CTL ex vivo, as well. The 
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The peptides may also find use as diagnostic reagents. For example, a 
peptide of the invention may be used to determine the susceptibility of a particular 
individual to a treatment regimen which employs the peptide or related peptides, and thus 
may be helpful in modifying an existing treatment protocol or in determining a prognosis 
5 for an affected individual. In addition, the peptides may also be used to predict which 
individuals will be at substantial risk for developing chronic infection. 

Examples 

Materials and Methods 

Cells, The following Epstein-Barr virus (EBV) transformed homozygous cell lines 

10 were used as sources of human HLA Class II molecules: LG2 [DRBlcOlOl (DR1)1; 
GM3107 [DRB50101 (DR2w2a)]; MAT (DRB10301 (DR3)1; PREISS [DRB10401 
(DR4w4)l; BIN40 [DRB10404 (DR4wl4)l; SWEIG [DRB11101 (DR5wll)]; PITOUT 
[DRB10701 (DR7)] (a); KT3 [DRB10405 (DR4wl5)]; Herluf [DRB 11201 (DR5wl2)]; 
HO301 [DRB11302 (DR6wl9)]; OLL [DRB10802 (DR8w2)]; and HTC9074 [DRB10901 

15 (DR9), supplied as a kind gift by Dr. Paul Harris, Columbia University]. In some 

instances, transfected fibroblasts were used: L466.1 [DRB11501 (DR2w2b)]; TR81.19 
[DRB30101 (DR52a)]; and L257.6 [DRB40101 (DRw53)]. (Valli, et al. J. Clin. Invest. 
91:616 (1993). Cells were maintained in vitro by culture in RPMI 1640 medium 
supplemented with 2mM L-glutamine [GIBCO, Grand Island, NY], 50/xM 2-ME, and 

20 10% heat-inactivated FCS [Irvine Scientific, Santa Ana, CA]. Cells were also 
supplemented with 100 iiglvol of streptomycin and lOOU/ml of penicillin [Irvine 
Scientific]. Large quantities of cells were grown in spinner cultures. 

Cells were lysed at a concentration of 10 8 cells/ml in PBS containing 1 % 
NP-40 [Fluka Biochemika, Buchs, Switzerland], ImM PMSF [CalBioChem, La Jolla, 

25 CA], 5mM Na-ortho vanadate , and 25mM iodoacetamide [Sigma Chemical, St. Louis, 

Mo]. The lysates were cleared of debris and nuclei by centrifugation at 10,000 x g for 20 
min. 

Affinity purification of HLA-DR molecules. Class II molecules were purified by 
30 affinity chromatography as previously described (Sette, et al. J. Immunol. 142:35 (1989) 
and Gorga, et al. J. Biol. Chem. 262:16087 (1987)) using the mAb LB3.1 coupled to 
Sepharose 4B beads. Lysates were filtered through 0.8 and 0.4 filters and then passed 
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over the anti-DR column, which were then washed with 15-column volumes of lOmM 
TRIS in 1 % NP-40, PBS and 2-column volumes of PBS containing 0.4% 
n-octylglucoside. Finally, the DR was eluted with 50mM diethylamine in 0.15M NaCl 
containing 0.4% n-octylglucoside, pH 11.5. A 1/25 volume of 2.0M Tris, pH 6.8, was 
added to the eluate to reduce the pH to -8.0, and then concentrated by centrifugation in 
Centriprep 30 concentrators at 2000 rpm (Amicon, Beverly, MA). 

Class II peptide-binding assays. A panel of 13 different specific DR-peptide assays were 
utilized in the present study. These assays were chosen as to be representative of the most 
common DR alleles. Table I lists for each DR antigen, the representative allelic product 
utilized, the cell line utilized as a source of DR, and the radiolabled probe utilized in the 
assay. Purified human Class II molecules [5 to 500 nM] were incubated with various 
unlabeled peptide inhibitors and 1-10 nM 124 I-radiolabeled probe peptides for 48h in PBS 
containing 5% DMSO in the presence of a protease inhibitor cocktail. The radiolabeled 
probes used were HA Y307-319 (DR1), Tetanus Toxoid[TT] 830-843 (DR2w2a, 
DR5wlll, DR7, DR8w2, DR8w3, DR9), MBP Y85-100 (DR2w2b), TT1272-1284 
(DR52a), MT 65 kD Y3-13 with Y7 substituted with F for DR3, a non-natural peptide 
with the sequence YARFQSQTTLKQKT (DR4w4, DR4wl5, DRw53) (Valli, et al 
supra), and for DR5wl2, a naturally processed peptide eluted from the cell line C1R, 
EALIHQLINPYVLS (DR5wl2) and 650.22 peptide, (TT 830-843 A - S836 analog), for 
DR6wl9. 



Peptide inhibitors were typically tested at concentrations ranging from 1201 fig! 'ml to 1.2 
ng/ml. The data were then plotted and the dose yielding 50% inhibition (IC50) was 
measured. In appropriate stoichiometric conditions, the IC50 of an unlabeled test peptide 
to the purified DR is a reasonable approximation of the affinity of interaction (Kd). 
Peptides were tested in two to four completely independent experiments. The final 
concentrations of protease inhibitors were: ImM PMSF, 1.3nM 1.10 phenanthroline, 73 
/iM pepstatin A, 8mM EDTA, and 200 /iM N alpha-p-tosyl-L-lysine chloromethyl ketone 
(TLCK) [All protease inhibitors from CalBioChem, La Jolla, CA]. Final detergent 
concentration in the incubation mixture was 0.05% Nonidet P-40. Assays were performed 
at pH 7.0 with the exception of DR3, which was performed at pH 4.5, and DRw53, which 



Radiolabeled peptides were iodinated using the chloramine-T method. 
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was performed at pH 5.0. The pH was adjusted as previously described (Sette, et al /. 
Immunol 148:844 (1992)). 



filtration on TSK2000 columns (TosoHaas 16215, Montgomery ville, PA), and the fraction 
5 of bound peptide calculated as previously described (Sette, et al , (1989) supra). In 
preliminary experiments, the DR prep was titered in the presence of fixed amounts of 
radiolabeled peptides to determine the concentration of Class II molecules necessary to 
bind 10-20% of the total radioactivity. All subsequent inhibition and direct binding assays 
were the performed using these Class II concentrations. 

10 

DRB1 specificity of DR4wl5, DR6wl9, DR8w2, DR8w3, and DR9 assays. 



are not separated from P3 (and/or P4 and P5) molecules. Development and validation of 
assays in regard with DRp chain specificity has been described in detail elsewhere for 
15 many of the DR alleles listed above (108). Herein we describe for the first time DR4wl5, 
DR6wl9, DR8w2, DR8w3, and DR9 assays. Experiments addressing the p chain 
specificity of these new assays are described in the present section. 



determination of the specificity of the DR4wl5 binding assay is complicated in that the 
20 same radiolabeled ligand is used for both the DR4wl5 and DRw53 binding assays. Since 
typically pi chains are expressed at 5-10 fold higher levels than other p chains, and all 
binding assays are performed utilizing limiting DR amounts, it would be predicted that the 
dominant specificity detected in the assay would be DR4wl5. To verify that this was 
indeed the case, the binding pattern of a panel of 58 different synthetic peptides in the 
25 putative DR4wl5 specific assay with that obtained in a DRw53 specific assay (which uses 
a DRw53 fibroblast as the source of Class II molecules). Two very distinct binding 
patterns were noted, and in several instances, a peptide bound to one DR molecule with 
high affinity, and did not bind to the other (data not shown). 



30 the EBV transformed homozygous cell line H0301, which co-expresses DRB30301 

(DR52a). While the radiolabeled ligand used in the DR6wl9 assay is different than that 
used for the DR52a assay, the ligand is related (i.e., is a single substitution analog) to a 



Class II peptide complexes were separated from free peptide by gel 



Because the antibody used for purification is a-chain specific, pi molecules 



DR4wl5. The p4 product DRw53 is co-expressed with DR4wl5 and the 



DR6wl9. The DR6wl9 assay utilizes as the source of Class II molecules 
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high affinity DR52a binder. As was done in the case of DR4wl5, the specificity of the 
assay was investigated by analyzing the binding capacity of a panel of naturally occurring 
peptides for DR6wl9 and DR52a. The two assays demonstrated completely different 
binding specificities. For example, in terms of relative binding, TT 1272-1284 binds 
5 63-fold better in the DR52a assay than in the DR6wl9 assay. Conversely, the Invariant 
chain peptide binds 189-fold better in the DR6wl9 assay. In conclusion, these data 
demonstrated that the binding of the radiolabeled peptide 650.22 to purified Class II MHC 
from the H0301 cell line is specific for DR6wl9. 



have shown that the TT 830-843 radiolabeled probe peptide does not bind to DRw53 
molecules (Alexander, et aL, Immunity 1:751 (1994)). 

15 Results 

DR binding affinity of antigenic peptides recognized by DR restricted T cells 



significant, we compiled the affinities of a panel of 32 reported instances of DR restriction 
of a given T cell epitope. In approximately half of the cases, DR restriction was 

20 associated with affinities of less than 100 nM, and in the other half of the instances, with 
IC50% in the 100-1000 nM range. Only in 1 out of 32 cases (3. 1%) DR restriction was 
associated with IC50% of 1000 nM or greater. It was noted that this distribution of 
affinities differs from what was previously reported for HLA class I epitopes, where a vast 
majority of epitopes bound with IC50% of 50 nM or less (Sette, et al. , JI, 1994). This 

25 relatively lower affinity of class II restricted epitope interactions might explain why 

activation of class II restricted T cells in general requires more antigen relative to class I 
restricted T cells. 



affinity threshold associated with immunogenicity in the context of DR molecules, and for 
30 this reason a suitable target for our studies. 



10 



DR8w2 and DR8w3. The pi specificity of the DR8w2 and DR8w3 assays 
is obvious in that no P3 (and/or B4 and P5) molecule is expressed. 

DR9. The specificity of DR9 assay is inferred from previous studies which 



To define a threshold DR binding affinity, to be considered as biologically 



In conclusion, this analysis suggested that 1000 nM may be defined as an 



PI and P6 anchors are necessary but not sufficient for DRB10401 binding 



WO 98/32456 




PCT/US98/01373 



Several independent studies have pointed to a crucial role in DRB 10401 
binding of a large aromatic or hydrophobic residue in position 1 , near the N -terminus of 
the peptide and of a 9-residue core region (residues 1 through 9). In addition, an 
important role has been demonstrated for the residue in position six (P6) of this 9-residues 
5 core region. Short and/or hydrophobic residues were in general preferred in this position 
(O' Sullivan, etaL, JI 147:2663, 1991; Sette, etaL, JI 151:3163, 1993; Hammer, etai y 
Cell 74:197, 1993 and Marshall, et a/., JI 154:5927, 1995). 

In the present set of experiments, a library of 384 peptides was analyzed for 
DRB 10401 binding capacity and screened for the presence of the P1-P6 motif (that is, F, 

10 W, Y, L, I, V or M in PI and S, T, C, A, P, V, I, L or M in P6, at least 9 residues apart 
from the peptide C-terminus. This set of 384 peptides contained a total of 80 DR4w4 
binders (specifically 27 good binders [IC50 of 100 nM or less], and 53 intermediate 
binders [IC50 of the 100-1000 range]. Seventy-seven out of the 80 DR4w4 binders (96%) 
carried the P1-P6 motif. However, it should be noted that most non-DR4w4 binding 

15 peptides also contained the P1-P6 motif. Of 384 peptides included in our database, only 
125 were "P1-P6 negative." Only three of them (6%) bound appreciably to purified 
DR4w4 as opposed to 77/259 (30%) of the T1-P6 positive" peptides. Therefore, these 
results demonstrate that presence of suitable PI and P6 anchors are necessary but not 
sufficient for DRB 10401 binding. 

20 

A detailed map of DRB10401 peptide interactions 

Next, for each P1-P6 aligned core region, in analogy with what the strategy 
previously utilized to detail peptide class I interactions the average binding affinity of 
peptides carrying a particular residue, relative to the remainder of the group, were 

25 calculated for each position. Following this method a table of average relative binding 
(ARB) values was compiled. This table also represents a map of the positive or negative 
effect of each of the 20 naturally occurring amino acids on DRB10401 binding capacity 
when occupying a particular position, relative to the main P1-P6 anchors (Figure 1). 

Variations in ARB values greater than four fold (ARB * 4 or <; 0.25) were 

30 arbitrarily considered significant and indicative of secondary effects of a given residue on 
DR-peptide interactions. Most secondary effects were associated with positions 4, 7, and 
9. These positions correspond to secondary anchors engaging shallow pockets on the DR 
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molecule. In addition, significant secondary effects were detected for M in position 3 
(ARB = 12.8) T in position 3 (ARB = 4.34) and I in position 5 (ARB = 4.4). 

Development of a DRB10401 specific algorithm 

Next, the ARB table was utilized to develop a DRB 10401 specific 
algorithm. In order to predict 0401 binding propensity, each aligned P1-P6 sequence was 
scored by multiplying, for each position, the ARB value of the appropriate amino acid. 
According to this procedure, a numerical "algorithm score" was derived. If multiple 
P1-P6 alignments were possible, binding scores were calculated for each one and the best 
score was selected. The efficacy of this method in predicting 0401 binding capacity is 
shown in Table Ha. 



set of predicted peptides to 156. This set still contained 72 out of 80 (90%) of the total 
high or intermediate DR binders. Raising the cut-off to an algorithm score of -16.44 or 
higher still allowed identification of 60 out of 80 (75%) of the DR4w4 binding peptides. 
Of the whole 107 peptide set, twenty-five of them were either good or intermediate 
binders. In other words, as expected, increasing the algorithm score stringency predicted 
a smaller fraction of the total binders present in the set, but at the same time less false 
positive peptides were identified. 

Blind test of the predictive power of the DRB10401 specific algorithm 

To verify that the predictive capacity of our algorithm was not merely a 
reflection of having utilized the same data set to test and define the algorithm itself, we 
further examined its efficacy in a blind prediction test. For this scope we utilized data 
from an independent set of 50 peptides, whose binding affinities were known, but that had 
not been utilized in the derivation of the algorithm. As shown in Table lib, the algorithm 
was effective in predicting DR4w4 binding capacity of this independent peptide set. The 
algorithm score of -17.00 identified a total 18 peptides. This set contained 3/3 (100%) of 
all good binders, and 8/11 (70%) of all intermediate binders in the entire test set of 50 
peptides. Increasing the cut-off value to -16.44, identified a set of nine peptides. Seven 
of them (78%) were either good or intermediate binders. This set contained 7 out of 14 



Considering only peptides with algorithm scores above -17.00 narrowed the 
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(50%) of the binders contained in the blind prediction peptide set. In conclusion, these 
data supports the validity of the DR4w4 specific algorithm described above. 

Detailed maps of DRB10401, DRB10101, and DRB10701 peptide binding specificities 
5 Next, we analyzed the binding to purified DR1 and DR7 molecules for the 

same set of 384 peptides utilized to define the DR4w4 algorithm. It was found that this set 
contained 120 and 59 binders for the DR1 and DR7 alleles, respectively. A total of 158 
peptides were capable of binding either DR1, DR4w4 or DR7. A large fraction of them 
(73/158; 46%) were also degenerate binders, which bound two or more of the three alleles 
10 thus far considered. Furthermore, we also found that more than 90% of the DR1 or DR7 
good and intermediate binders carried the P1-P6 motif. Most importantly, 72 out of 73 
(99%) degenerate DR binders carried this motif (data not shown). In conclusion, this 
analysis suggests that P1-P6 based algorithms might be utilized to effectively predict 
degenerate DR binders. 

15 In analogy with what was described above for DR4w4 molecules, specific 

algorithms were designed for the DR1 and DR7 alleles. Figures 2A and 2B detail the 
allele specific maps defined according to this method. 

As in the case of DRB 10401, most secondary effects were concentrated in 
positions 4, 7 and 9. Position 4 was especially prominent in the case of DR1, while 

20 position 7 was the most prominent secondary anchor for DR7. Specific algorithms were 
developed based on these maps, and it was found that the cut-off values necessary to 
predict 75% or 90% of the binders were -19.32 and -20.28 for DR1, and 20.91 and 
-21.63 for DR7, respectively. Depending on the particular allele or cut off value selected, 
40 to 60% of the predicted peptides were in fact good or intermediate binders (data not 

25 shown). 

Development of a DR1-4-7 combined algorithm 

Finally, we examined whether a combined algorithm would allow to predict 
degenerate binders. For this purpose, the sequences of the 384 peptides in our database 
30 were simultaneously screened with the three (DR1, 4w4, and 7) specific algorithms. It 
was found that an even 100 peptides were predicted (using the 75% cut off) to bind either 
two or three of the alleles considered. This set contained 59 out of 73 (81 %) of the 
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peptides which were in fact capable of degenerate 1-4-7 binding (defined as the capacity to 
bind to more than one of the DR1, 4w4 or 7 alleles) (Table III). 

Definition of a target set of DR specificities, representative of the world population 

5 The data presented in the preceding sections illustrates how peptides capable 

of binding multiple DR alleles can be identified by the use of a combined " 1-4-7" - 
algorithm. Next, we wished to examine whether the peptides exhibiting degenerate 1-4-7 
binding behavior would also bind other common DR types as well. As a first step in our 
experimental strategy, we sought to define a set of target DR types representative of a 

10 large (;> 80%) fraction of the world population, irrespective of the ethnic population of 

origin. For this purpose, seven additional DR antigens were considered. For each one of 
the DR antigens considered in this study, (including DR1, 4 and 7), the estimated 
frequency in various ethnicities, according to the most recent HLA workshop (11th, 1991) 
is shown in Table IVa, together with the main subtypes thus far identified. 

15 For the purpose of measuring peptide binding affinity to the various DR 

molecules, one representative subtype for each DR antigen was chosen (Table I). It 
should be noted that for most antigens, either one subtype is by far the most abundant, or 
alternatively a significant degree of similarity in the binding pattern displayed by the 
different, most abundant subtypes of each DR antigen is likely to exist (see comments 

20 column of Table IVb). One exception to this general trend is represented by the DR4 

antigen, for which significant differences in peptide specificity between the 0401 and 0405 
have been reported. Since both alleles are quite frequent (in Caucasians and Orientals, 
respectively) we included both DR 0401 and 0405 in the set of representative DR binding 
assays. 

25 Our set of representative assays is mostly focused on allelic products of the 

gene, because these molecules appear to be the most abundantly expressed, serve as the 
dominant restricting element of most human class III responses analyzed thus far, and 
accurate methods for serologic and DNA typing most readily available. However, we 
have also considered in our analysis assays representative of DRB3/4/5 molecules (Table 

30 IVc). These molecules serve as a functional restriction element, and their peptide binding 
specificity has been previously shown to have certain similarities to the specificity of 
several common DRPj allelic products. 
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A general strategy for prediction of DR-degenerate binders. 

To test whether the 1-4-7 combined algorithm would also predict degenerate 
binding to other common DR types, we measured the capacity of three different groups of 
synthetic peptides to bind the panel of purified HLA DR molecules. The three different 
5 peptide sets were: A) 36 peptides which did not score positive in the combined 1-4-7 
algorithm (non-predictions), B) 36 peptides which did score positive for the 1-4-7 
algorithm, at the 75% cut off level, but had been found upon actual testing not to be 
degenerate 1-4-7 binders ("wrong" predictions), and C) 29 peptides which scored positive 
in the 1-4-7 algorithm, and also proved upon experimental testing, to be actual 1-4-7 
10 degenerate binders (correct predictions). The results of this analysis are shown in Table 
V. 

Within the set of "non-predictions" peptides (Table Va) only 3 out of 34 
(9%) bound at least two of the DR1, 4w4 or 7 molecules. Interestingly, 2 (1136.04 and 
1 136.29) out of 3 of these peptides were also rather crossreactive, and bound additional 

15 DR types (DR2w2 P2, DR4wl5, 5wl 1 and 8w2 in the case of 1 136.04, and 2w2 p2, 

4wl5, 9 and 5wl2 in the case of 1136.29). Peptides from the "wrong predictions" peptide 
set (Table V5), by definition bound at the most only one of the DR1, 4w4 or DR7 
molecules, and were also poorly degenerate or other DR types with 
only two peptides (1136.22 and 1188.35) binding a total of three DR molecules. Within 

20 this peptide set, no peptide bound four or more of the DR molecules tested (data not 
shown). 

These results are contrasted by data obtained with the peptide set 
corresponding to peptides which were first predicted by the use of the combined 1,4,7 
algorithm, and then experimentally found to be degenerate DR1-4-7 binding. Fourteen out 
25 of 29 peptides tested (48%) bound a total of five or more alleles. Four of them were 
remarkably degenerate (1188.16, 1188.32, 1188.34 and F107.09) and bound a total of 
nine out of the 11 DR molecules tested. In conclusion, these results suggest that a strategy 
based on the sequential use of a combined DR1, 4, 7 algorithm and quantitative DR1, 4, 7 
binding assays can be utilized to identify broadly crossreactive DR binding peptides. 

30 
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Definition of the HLA-DR 1-4-7 supertype 

The data presented above also suggested that several common DR types are 
characterized by largely overlapping peptide binding repertoires. When this issue was 
analyzed in more detail, by analyzing the binding pattern of the thirty-two peptides from 
5 Table Va and b which were actual DR1-4-7 degenerate binders. Thirty-one of them (97%) 
bound DR1, 22 (69%) DR4w4 and 21 (66%) DR7. These files are contrasted with the 
low percentages of binding observed amongst the remainder non-degenerate binding 
peptides (17/67 (25%), 8/67 (12%) and 7/67 (10%), for DR1, 4w4 and 7, respectively) 
(Table VII). 

10 Interestingly, a large fraction of the 1-4-7 degenerate binders also bound 

certain other common DR types. Sixteen (50%) bound DR2w2a, 18 (56%) DR6wl9, 18 
(56%) DR2w2b and 20 (62%) DR9. In all cases, the frequency of binding in the non-1-4- 
7 degenerate peptide set was much lower (Table VIII). 

Significant, albeit lower, frequencies of cross reactivity were noted also for 

15 DR4wl5, DR5wll, and DR8w2 (in the 28 to 37% range). Finally, negligible levels of 
cross reactivity were observed in the case of DR3 and 5wl2 and DR53. Further studies 
will address whether either of these two group of molecules (DR4wl5, 5wll, and 8w2 on 
one hand, and DR3, DR53 and 5wl2 on the other) might belong to different DR 
supertypes. 

20 In conclusion, these data demonstrates that a large set of DR molecules 

encompassing DR1, 4w4, 2w2a, 2w2b, 7, 9 and 6wl9 is characterized by largely 
overlapping peptide binding repertoires. 



Discussion 

25 In the present report we have analyzed the peptide binding specificity of a 

set of 13 different DR molecules, representative of DR types common among the 
worldwide population. Detailed maps of secondary anchors and secondary interactions 
have been derived for three of them (DR4w4, DR1 and DR7). Furthermore, we 
demonstrated that a set of at least seven different DR types share overlapping peptide 

30 binding repertoires; and consequently that broadly degenerate HLA DR binding peptides 
are a relatively common occurrence. This study also describes computerized procedures 
which should greatly assist in the task of identification of such degenerate peptides. 
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We would like to discuss the data in the context of our current 
understanding of peptide-class II interactions, as well as in the context of the recently 
described class I supermotifs. Finally, the potential implications of broadly degenerate 
class II epitopes for epitope based vaccine design should also be considered. 
5 Firstly, our studies illustrate how the vast majority of the peptides binding 

with good affinity to DR4w4, DR1, DR7 and most of the other DR types analyzed in the 
current study (data not shown), are all characterized by a P1-P6 motif consistent with the 
one originally proposed by O'Sullivan, et al. Crystallographic analysis of DRl-peptide 
complexes revealed that the residues occupying these positions engage two complementary 

10 pockets on the DR1 molecule, with the PI position corresponding to the most crucial 
anchor residue and the deepest hydrophobic pocket. Our analysis also illustrates how 
other "secondary anchor" positions drastically influence in an allele-specific manner 
peptide binding capacity. Position 4 was found to be particularly crucial for DR1 binding, 
position 9 for DR4w4, and position 7 for DR7. These data are consistent with previous 

15 results which originally described such allele-specific anchors, and with crystallographic 
data which illustrates how these residues engage shallow pockets on the DR molecule. 

Secondly, our studies illustrate how an approach based on alignment and 
calculation of average relative binding values of large peptide libraries allows definition of 
quantitative algorithms to predict binding capacity. The present study extends those 

20 observations to two other common HLA-DR types, and also illustrates how the combined 
use of the 1-4-7 algorithms can be of aid in identifying broadly degenerate DR binding 
peptides. 

The data presented herein suggest that a group of common DR alleles, 
including at least DR1, DR2w2a, DR2w2b, DR4w4, DR6wl9, DR7 and DR9 share a 

25 largely overlapping peptide repertoire. Degenerate peptide binding to multiple DR alleles, 
and recognition of the same epitope in the context of multiple DR types was originally 
described by Lanzavechia, Sinigallia's and Rothbard's groups. The present study provides 
a classification of alleles belonging to a main HLA-DR supertype (DRl-4-7-like) which 
includes DR1, DR2w2a, DR2w2b, DR4w4, DR7, DR9, DR6wl9. On the basis of the 

30 data presented herein, at least two additional groups of alleles exist. The first group 

encodes for molecules with significant, albeit much reduced overlap with the 1-4-7-like 
supertype (DR4wl5, 8w2, 5wll). The second group of alleles (5wl2, 3wl7, and w53) 
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clearly has little repertoire association with the 1-4-7 supertype. In this context it is 
interesting to note that Hammer, et al noted that good DR5wl 1 binding peptides are 
frequently characterized by positively charged P6 anchor (which would be poorly 
compatible) with the herein proposed 1-4-7 supermotif. It is also interesting to note that 
5 Sidney, et al proposed that DR3wl7 binds a set of peptides largely distinct from those 
bound by other common DR types. Future studies will have to determine whether any of 
the molecules listed above can be grouped in additional DR supertypes. Our group is 
currently investigating whether analysis of polymorphic residues lining the peptide binding 
pockets of DR can be utilized to aid in the classification and prediction of HLA DR 
10 supertypes. 

We would like to comment on similarities and differences between the HLA 
DR supertype described herein and the recently described HLA class I supermotif s. Class 
I supermotifs are clear-cut and, as a rule, non-overlapping. Four of them have been 
described all approximately equally frequent amongst the worldwide population. By 

15 contrast, the repertoire defining the HLA DR supertype herein described is not clear-cut 
and overlaps, at least in part, with the repertoire of other alleles. It also appears that on 
the basis of the data presented in Tables I and IV, even if other DR supertypes exist, the 
DR1-4-7 is going to be by far the most abundantiy represented worldwide. 

Finally, we would like to point out the possible relevance of these data in 

20 terms of development of epitope based vaccines. Class II restricted HTL have been 

implicated in protection from, and termination of many important diseases. Inclusion of 
well defined class II epitopes in prophylactic or therapeutic vaccines may allow to focus 
the immune response towards conserved or subdominant epitopes, and avoid suppressive 
determinants. Based on the data presented herein (Table IV), the DR 1-4-7 supertype 

25 would allow coverage in the 50 to 80% range, depending on the ethnicities considered. It 
is thus possible that broad and not ethnically biased population coverage could be achieved 
by considering a very limited number of peptide binding specificities. 

Based on the results present above, the sequences of various antigens of 
interest were scanned for the presence of the DR 1-4-7 motifs. Peptides identified using 

30 this approach are broadly cross reactive, class II restricted T cell epitopes. Table VIII 
presents a listing of such peptides derived from HBV, HCV, HIV and Plasmodium 
falciparum (Pf). A total of 146 peptides were identified: 35 from DHBV, 16 from HCV, 
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50 from HIV, and 45 from Pf. Standard conservancy criteria were employed in applicable 
cases. 

The above examples are provided to illustrate the invention but not to limit 
its scope. Other variants of the invention will be readily apparent to one of ordinary skill 
in the an. All publications, patents, and patent applications cited herein are hereby 
incorporated by reference for all purposes. 
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Table II 

An algorithm to predict DRB 1*0401 binding capacity, 
a) Original peptide set 





No. of peptides (Binding nM) 




Selection 
Criteria 


High 
£100 


Inter. 
100-1000 


Non 
>1000 


Total 


None 


27 


53 


304 


384 


P1-P6 


27 


50 


182 


259 


-17.00 " 


27 


45 


84 


156 


-16.44 ° 


25 


35 


47 


107 



1) Algorithm score which predicts 90% of all binders. 

2) Algorithm score which predicts 75% of all binders. 
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Table II 

b) Blind test of the predictive power of the DRB1*0401 algorithm. 

No. of peptides (Binding nM) 



Selection High Inter. Non 

Criteria $100 100-1000 >1000 Total 

None 3 11 36 50 

P1-P6 3 9 28 40 

-17.00 3 8 7 18 

-16.44 3 4 2 9 
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Table III 

A combined "1-4-7" algorithm. 



PCT/US98/01373 



Selection 
Criteria 



Degenerate 
Binders " 



Percent of TotaJ 
Degenerate Binders 



None 
P1-P6 



Combined Algorithms 
(90% Cutoff Value) 



73/384 
72/259 

67/147 



100% 
99% 

92% 



Combined Algorithms 
(75% Cutoff Value) 



59/100 



81% 



1) Degenerate binders are defined as peptides binding at least two out of the three 
DR1, 4w4, and 7 molecules with an IC50 of 1 nM or less. 
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Table IV 



Phenotypic frequencies of 10 prevalent HLA-DR antige: 



Anrigen 


Alleles 


Cauc. 


Phenotypic Frequencies 
Blk. Jpn. Chn. Hisp. 


Avg 


DR1 


DRBl"0101-03 


1R 5 
100 


ft 4 


in 7 




10.1 


10.4 


DR2 


DRB1M501-O3 


10 0 


14.0 


30.9 


22.0 


15.0 


203 


DR3 




17.7 


193 


0.4 


73 


14.4 


11.9 


DR4 


DRBl'0401-12 


23.6 


6.1 


40.4 


21.9 


29.8 


24.4 


DR7 


DRBr0701-02 


26.2 


11.1 


1.0 


15.0 


16.6 


14.0 


DR8 


DRBl'0801-5 


5.5 


10.9 


25.0 


10.7 


233 


15.1 


DR9 


DRBrO9Oll,09012 


3.6 


4.7 


243 


19.9 


6.7 


11.9 


DR11 


DRBrilOl-05 


17.0 


18.0 


4.9 


19.4 


18.1 


153 


DR12 


DRB1M201-O2 


2^ 


53 


13.1 


17.6 


5.7 


8.9 


DR13 


DRB1M301-O6 


21.7 


163 


14.6 


12.2 


103 


15.1 


Total 




97.0 


83.9 


98.8 


95.5 


95.6 


94.7 
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Table VII 



Frequency of Binders 





1-4-7 


Non 1-4-7 


DRType 


Degenerate 


L/egcneraic 


Binders (%) 


Binders (%) 


1 


31/32 (97) 


17/67 (25) 


4w4 


22/32 (69) 


8/67 (12) 


7 


21/32 (66) 


7/67 (10) 


a 

7 


20/32 (62) 


2/67 (3D) 


6wl9 


18/32 (56) 


6/67 (8.9) 


2w2Gb 


18/32 (56) 


16/67 (24) 


2w2Ba 


16/32 (50) 


10/67 (15) 


4wl5 


12/32 (37) 


4/67 (6.0) 


8w2 


10/32 (31) 


3/67 (45) 


5wll 


9/32 (28) 


6/67 (8.9) 


5wl2 


3/32 (9.4) 


4/67 (6.0) 


3wl7 


1/32 (3.1) 


0/67 (0) 


w53 


2/16 (13) 


7/43 (16) 
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Table VIII 



Sequence 


Source 


lit 
Poe 


Conservancy 


Predicttd 
1-4-7 




Pf 1 KAP 


227 


100 


3 


ILS VFFL A LFFITFN 


i f Exn 


3 




3 


KSKYKLAT5VI A HI 


Pf EXPl 


71 




3 




Pf EXPl 


73 




3 


LGNVKYT VTVPT TTT ' 


Pf TRAP 


4 


100 


3 


lo v r r LA Lr r lir N K 


Pf EXPl 


4 




3 


T VMT T TTn_TT KI/^VTTV 


Pf LSAl 


13 




3 


IVlAJLo V ri"L AU*H1 


Pf EXPl 


1 




3 


iviivN LA *Lo VoorLrV 


Pf CSP 


2 


95 


3 


fN so i i»li ivi v LbrLr L 


Pf CSP 


417 


95 


3 


in v i Li vi vrulrrUL 


Pf TRAP 


6 


100 


3 


CtrvXTT VMT T ItXTTM 


Pf LSAl 


8 




3 




Pf EXPl 


6 




3 


YFTLVlMt ! rPWllsjFv " 


Pf LSAl 


10 




3 


YISFYFILVNI T 1PM 


Pf LSAl 


6 




3 


AGLLCNVSTVT T nn\f "™ 


Pi EXPl 


82 




2 


ANOLWILTDCIPni 


l'l IKAT 


153 


100 


2 


AYKFWPCAATPYAt~ 

*» * *v» ▼ ▼ i v#/% nil i Aw 


ri ikai j 


514 


80 


2 


DKELTMSNVKNVSQT 


nr r cai 


81 




2 


FN WNSSIGLIMVLS 


ft KJbT 


413 


100 


2 


FYFIL V NLLLFHl NG 


ri LbAl 


9 




2 


GLAYKFWPGAATFY 


I f I KAI 7 


512 


80 


2 


CRDVONNIVDEnCYR 


Tit TD A T* 

It I KAI 


25 


90 


2 


HILY1SFYFTLVNLL 

****** wi J i lb' 11 ^ r f i 


Pf i cai 
I r Lb A i 


3 




2 


HNWVNHAVPLAMKIJ 


Tit T"D A 11 

1 i I KAI' 


62 


80 


2 


IVRiFFDLFLVNGR 


Of to a r> 
1 I IKAr 


12 


100 


2 


KFVVPCAATPYACEP 


ill KAI 


516 


80 


2 


KSLLRNXCVSENrFL 


Pf I <5Al 


QO 

yo 




2 


KYLVTVFUFFDLFL 


Pf TRAP 1 


a 

0 


1U0 


2 


LAGLLGhA^STVLLGC 


Pf EXPl 


ttl 
01 




2 


LCNV5TVLLGGVGLV 


Pf PXPl 


AC 




2 


UFFDLFLVNCRDVQ 


Pf TRAP 


It 
19 


1UU 


2 


LWILTDCIPDSIQD 


Pf TRAP 


156 


100 


2 


QLWTLTDGIPDS1Q 


Pf TRAP 


155 


100 


2 


RCYYIPHQ5SLPQDN 


Pf LSAl 


1669 




2 


RHNWVNHAVPLAMKL 


Pf TRAP 


61 


80 


2 


RHPFKIGS5DPADNA 


Pf EXPl 


107 




2 


SSVFNWNS5IGUM 


Pf CSP 


410 


95 


2 


VFNWNSSICUMVL 


Pf CSP 


412 


95 


2 


VKNVICPFMKAVCVE 


Pf TRAP 


223 


100 


2 


VK YLV rVFLHTDLF 


Pf TRAP 


7 


100 


2 


VSTVLLGGVGLVLYN 


Pf EXPl 


88 




2 


WENVKNVICPFMKAV 


Pf TRAP 


220 


100 


2 


YKFWPGAATPYACE 


Pf TRAP 


515 


80 


2 
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Table VIII 



Sequence 5 


iource 


1st 
Pot 


Conservancy 


Predicted 


ENRWQVMIVWQVDRM I 


-I1V1 VIF 


2 


81 


3 


ERYLKDQQLLCIWCCS I 


-nvi ENV 


589 




3 


ESELVSQUEQUKK I 


^rvi pol 


696 


80 


3 


FRKYTAFTIPSINNE 1 


■nvi pol 


303 


93 


3 


GQMVHQA1SPRTLNA 1 


-nvi GAG 


172 


88 


3 


IPEWEFVNTPFLVKL ] 


-nvi pol 


593 


93 


3 


LPFWAKEIVASCDK ] 


POL 


770 


87 


3 


NREILKEPVHCVYYD ) 


ravi pol 


485 


87 


3 


PAIFQ55MTKILEPF 1 


hnvi pol 


336 


80 


3 


ITWAXEIVASCDKC ] 


KTV1 POL 


771 


87 


3 


QEQIGWMTN NPPIPV 1 


HTV1 GAG 


276 


81 


3 


QCQMVHQAISPRTLN 


HTV1 GAG 


171 


85 


3 


SPA1FQSSMTTCILEP 


HTV1 POL 


335 


80 


3 


TLNFPlSnHTVFVK 


HIV1 POL 


176 


100 


3 


VKNWMTETLLVQNAN 


HTV1 GAG 


348 


81 


3 


VPVWKEATTTLFCAS 


HTV1 ENV 


54 


81 


3 


WEFVNTPPLVKLWYQ 


HT/1 POL 


596 


93 


3 


WVKWEEKAE5PEVI 


HIV GAG 


187 ! 


33 


3 


YYGVPVWKEATTTLF 


HtVl ENV 


51 


83 


3 


ASDFNLPPWAKETV 


HTV1 POL 


765 


80 


2 


ASCYTEAEVIPAETG 


HTV1 POL 


822 


93 


2 


DFNLPPVVAKHVAS 


HTV1 POL 


767 


87 


2 


EAIIRILQQLLFIHF 


HTV1 VPR 


58 


82 


2 


EKVYLAVWPAHKCIG 


HIV1 POL 


711 


93 


2 


ETAYFLLKLACR WPV 


HTV POL 


838 


65 


2 


EVQLCIPHPACLKKK 


HIV1 POL 


268 


80 


2 


FWEVQLGIPHPACLK 


HTV1 POL 


266 


100 


2 


CCTLNFPISPIEIVP 


HT/1 POL 


174 


100 


2 


CKIYKRWIILGLNKI 


HIV1 GAG 


294 


85 


2 


GTVLVCPTPVNIIGR 


HTV1 POL 


153 


100 


2 


HKAICTVLVCTTPVN 


HT/1 POL 


149 


93 


2 


ICTVLVGPTPVNnC 


HIV POL 


152 


74 


2 


KRWULCLNKIVRMY 


HTV1 GAG 


298 


88 


2 


KVYLAWVPAHKQGG 


HTV POL 


712 


74 


2 


UCTTAVPWNASWSNK 


HTV1 ENV 


607 




2 


LLQLTVWC1KQLQAR 


HIV1 ENV 


731 


80 


2 


NFPISPIETVPVKLK 


HTV1 POL 


178 


100 


2 


PQCWKCSPA1FQSSM 


HIV1 POL 


329 


87 


2 


PVN D CRNLLTQIGC 


HTV1 POL 


161 


87 


2 


QHLLQLTVWGIKQLQ 


WVl ENV 


729 


80 


2 


QQHLLQLTVWQKQL 


HIV1 ENV 


728 


80 


2 


SPEVTPMFSALSEGA 


HP/1 GAG 


197 


88 


2 


TKELQKQITKIQNFR 


HIV POL 


952 


67 


2 


TVLVCPTPVNnCRN 


HIV1 POL 


154 


100 


2 


VEAIIRILQQLLFIH 


HT/1 VPR 


57 


62 


2 


V1PMP5ALSEGATPQ 


HIV1 GAG 


200 


88 


2 


VNIICRNLLTQIGCT 


HP/1 POL 


162 


87 


2 


WCCSGKUCTTAVPWN 


HT/1 ENV 


601 




2 


WT1LGLNKIVRMY5P 


HT/1 GAC 


300 


88 


2 


YKRWDLGLNKIVRM 


HIV1 GAG 


297 


88 


1 2 


FILVNLUFHINGKI 


Pf L5A1 


11 




3 
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Table VUI 



Sequence 


Source 


1st 
Pos 


Conservancy 


Predicted 

1^"/ 


AEDLNLGNLNVSIPW 


HBV POL 


38 


95 


3 


DLNLGNLNVSIPWTH 


HBV POL 


40 


95 


3 


GFFLLTRILTIPQSL 


HBV ENV 


181 


80 


3 


IFLHlXLCUnXV 


HBV ENV 


245 


80 


3 


NLNVSIPVVTHCVCNF 


HBV POL 


45 


9S 


3 


PFLLAQFTSAICSW 


HBV POL 


523 


95 


3 


RF5WL5LLVPFVQWF 


HBV ENV 


332 


100 


3 


SPFLLAQFTSAIC5V 


HBV POL 


522 


95 


3 ! 


SVRP5WLSLLVPFVQ 


HBV ENV 


330 


80 


3 


AF5YMDDVVLCAKSV 


HBV POL 


546 


90 


2 


AGFFLLTRILTIPQS 


HBV ENV 


180 


80 


2 


FVQWFVGLSTTVWLS 


HBV ENV 


342 


95 


2 


GAHLSLRCLPVCAE5 


HBV X 


50 


90 


2 


CT5FVYVPSALNPAD 


HBV POL 


774 


80 


2 


GVVVTRTPPA YRPPN A 


HBV NUC 


123 


95 


2 


HLSLRGLPVCAF5SA 


HBV X 


52 


90 


2 


IH-U-IULCUFLL 


HBV ENV 


244 


80 


2 


ILLLCUFLLVLLDY 


HBV ENV 


249 


95 


2 


IVCLLGFAAPFTQCC 


HBV POL 


636 


90 


2 


KFAVP NLQ5LTNLLS 


HBV POL 


406 


95 


2 


LAQFT5A ICSWRRA 


HBV POL 


526 


95 


2 


LCUFLLVLLDYQCM 


HBV ENV 


252 


95 


2 n 


LCQVFADAlKrGWGL 


HBV POL 


694 


95 


2 


LHLY5HPIILGFRK1 


HBV POL 


501 


60 


2 


LLCLIFLLVLLDYQC 


HBV ENV 


251 


95 


2 


LVLLDYQCMLPVCPL 


HBV ENV 


258 


90 


2 


LYTFVQYVTVCLSPTV 


HBV ENV 


'339 


95 


2 


PLPIHT AELLA ACF A 


HBV POL 


722 


80 


2 


QCCYPALMPLYAQQ 


HBV POL 


648 


95 


2 


RDLLDT AS A LYREAL 


HBV NUC 


28 


80 


2 


SFCVWIRTPPAYRPP 


HBV NUC 


121 


90 


2 


SWLSRKYT5FPWLL 


HBV POL 


750 


85 


2 


VCLLCFAAPFTQCGY 


HBV POL 


637 


95 


2 


VPNLQSLTNLLSSNL 


HBV POL 


409 


85 


2 


WPKFAVPNLQ5LTNL 


HBV POL 


404 


95 


2 


KVLVLNPSV A ATLGF 


HCV 


1255 


100 


3 


PTLWARMILMTHFF5 


HCV 


2870 


79 


3 


ADLMGYIPLVCAPLC 


HCV 


131 


79 


2 


AVQWMNRUAFASRG 


HCV 


1917 


100 


2 


DLEUT5CSSNVSVA 


HCV 


2812 


93 


2 


DLYLV I KHADVIPVR 


HCV 


1134 


79 


2 


EDLVNLLPAILSPCA 


HCV 


1882 


79 


2 


rl ILi ALolOUlxiLrl 


HCV 


684 


79 


2 


GARLWLATATPPGS 


HCV 


1345 


79 


2 


QQYLAGLSTLPCNP 


HCV 


1776 


100 


2 


GVNYATCNLPGCSFS 


HCV 


161 


79 


2 


IQYLAGLSTLPGNPA 


HCV 


1777 


100 


2 


LHCLSAFSLHSYSPG 


HCV 


2919 


79 


2 


VNLLPAILSPGALW 


HCV 


1885 


79 


2 


VQW MNRU AFASRCN 


HCV 


1918 


100 


2 


YK VL VLNP5 V AATLG 


HCV 


1 1254 


100 


2 
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« 

Class II Peptides 



Peptida A A Sequanca Sourca 



008.00 


1 6 


SALLSSDITASVNCAK 


HEL 81-96 


200.06 


1 6 


SALSEGA7PQDLNTML 


H!V gp25 41-56 


213.10 


16 


NKAUELFRKDIAAKYK 


Sp. W. myo. 132-147 


S06.01 


20 


NKALELFRKDIAAKYKELGY 


SW Myo 132-151 


506.03 


18 


ALELFRKDIAAKYKELGY 


Sp>. W myo. 134-151 


506.05 


1 6 


ELFRKDIAAKYKELGY 


Sp. W myo. 136-151 


570.01 


16 


MAJCT1 AYDEE ARRGLE 


Haat Shock Prot 


705.06 


20 


KVYLPRMKMEEKYNLTSVLM 


Ova 279-298 


71 7.04 


14 


YASFVKTTTLHKFT.NH2 


combinatorial; DR2 optimized 


857.02 


20 


PHHTAUKJAILCWQELMOA 


HBV cor* 50-89 


865.01 


1 5 


YKMKMVHAAHAKMKM 


OVA KM cor. retention 


F0S0.03 


20 


GFYTTGAVnaFQDYKmC 


PLP 91-110 


FOB9.01 


15 


QNILLSNAPLGPQFP 


Tyrosinase 66-70 


F098.03 


20 


AAYAAOGYKVLVLNPSVAAT 


HCV NS3 1 242-1261 


FO98.04 


20 


QYKVUVLNP3VAATLGFGAY 


HCV NS3 1248*1 267 


F008.05 


1 4 


GYKVLVLNPSVAAT 


HCV NS3 1248-1261 


F098.06 


1 9 


SYVVTNMGLKFRQLLWPHI 


HBV Cor« 87-105 


F098.10 


1 2 


GLXFROaWFH 


HBV Cora 94-105 


F134.04 


20 


TLHGP7PLLYRLGAVONST 


HCV NS4 L20 


F134.05 


20 


NF1 SGIQ YLAGLSTLPGNP A 


HCV NS4 151-170 


F134.08 


21 


GEGAVQWMNRUAFASHGNHV 


HCV NS4 293-313 (1914-1934) 


IA-p5 


1 7 


KPVSOMRMATPLLMRPM 


Mouse invariant chain 85-101 


Tf-28 p1 


24 


LPKPPKPVSKMRMATPLLMGALPM 


Human invariant chain 8O-103 


27.0279 


15 


EYLVSFGVWtRTPPA 


HBV NUC 117 


27.0280 


15 


GVW1RTPPAYRPPNA 


HBV NUC 123 


27.0281 


1 5 


RHYLHTLWKAGILYK 


HBV POL 145 


27.0283 


15 


VPNLQSLTNLLSSNL 


HBV POL 409 


27.0288 


15 


WVTVYYGVPVWKEAT 


HIV1 ENV 47 


27.0293 


1 5 


YYGVPWVKEaTTTLF 


HIV1 ENV 51 


27.0294 


15 


VPVWKEATTTLFCAS 


HIV1 ENV 54 


27.0295 


1 5 


LSGIVQOQNNLLRA1 


HIVl ENV 711 


27.0296 


1 S 


OQHL1QLTVVJGIKOL 


HIV1 ENV 728 


27.0297 


1 5 


QHLLOTVVVGIKaLO 


HIVl ENV 729 


27.029B 


15 


LLQLTVWGJKQLQAR 


HIV1 ENV 731 


27.0304 


15 


OGQMVHQAJSPRTLN 


HIV1 GAG 171 


27.0307 


15 


SPEVIPMFSALSEGA 


HIVl GAG 197 


27.0310 


1 5 


QEaGWMTNNPPlPV 


HIV1 GAG 276 


27.0311 


15 


GEJYKRW1ILGLNW 


HIVl GAG 294 


27.0312 


1 5 


YKRWIILGLNKfVRM 


HIVl GAG 297 


27.0313 


1 5 


KRWULGLNKIVRMY 


HIVl GAG 298 


27.0314 


1 5 


WULGLNWVRMYSP 


HIVl GAG 300 


27.0315 


1 5 


VKNWNTCTU.VONAN 


HIV1 GAG 348 


27.0322 


15 


G7VLVGPTPVNIIGR 


HIV1 POL 153 


27.0324 


1 5 


PVN1IGRNLLTQ1GC 


HIVl POL 161 


27.0326 


IS 


GRNULTOIGC7LNFP 


HIV1 POL 166 


27.0328 


1 5 


TLNFPISP1ETVPVK 


HIV1 POL 176 


27.0329 


15 


NFP1SP1E7VPVKLK 


HIV1 POL 178 


27.0341 


1 5 


FRKYTAFHPSINNE 


HIVl POL 303 


27.0344 


15 


SPAJFQSSMTKILEP 


HIV1 POL 335 


27.0345 


16 


PAIFQSSMTWLEPF 


HIVl POL 336 


27.0349 


15 


QKLVGKLNWASOfYA 


HIV1 POL 437 


27.0350 


15 


VGKLNWASQjYAGIK 


HIV1 POL 440 


27.0351 


15 


NRE1LKEPVHGVYYD 


HIV1 POL 485 


27.0353 


15 


1PEWEFVNTPPLVKL 


HIV1 POL 593 


27.0354 


15 


WEFVNTPPtVKLWYQ 


HIVl POL 596 


27.0360 


15 


EQUKKEKVYLAWVP 


HIVl POL 705 
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Class U Peptides 

Ptptid* A A Sequence Source 



27.0361 1 5 EKVYUAWVP AH KGIQ 

27.0364 1 5 HSISrWRAMASOFNIPP 

27.0370 1 5 ASGYIEAEVJPAETG 

27.0372 IS AEHLKTAVOMAVF1H 

27.0373 1 5 KTAVQMAVF1HNFKR 
27.0377 1 5 QKQTTWQISFRVYYR 
27.0379 15 KLLWW3EQAWK3DN 

27 .038 1 15 ENRWQ^V\AOVS3RM 

27.0382 1 5 VEAIIRILOQILRH 
27.0384 1 5 FNWNSS1GUMVLS 

27.0387 15 MNYYGKQENWYSLKK 

27.0388 1 5 MRKLAILSVSSFLFV 
27.0390 1 5 KISSIGUMVLSFLFL 

27.0392 15 SSVFNWNSSiOUM 

27.0393 1 5 MK1LSVFFLALFF1I 
27.0398 IS RLVNLUFHINQW 
27.0400 1 5 HILYISFYFILVNLL 

27.0402 1 5 LUFHINGKIIKNSE 

27.0403 15 LVNLLIFWINGK11K 
27.0406 15 NUUFHINGKIUCNS 
27.0408 1 5 OTNFKSLLRNLGVSE 
27.041 2 1 5 AYKFWPQAATPYAQ 
27.0415 1 5 ISfVKYLVJVFUFFDL 

27.0417 15 VKNMGPFMKAVCVE 

27 .041 8 1 5 WENVKNVIGPFMKAV 
1 1 86.04 1 5 CSWRRAFPHCLAFS 
1 1 86.06 1 5 FVOWFVGLSPTVWLS 
11 86.1 0 15 LAQFT5AICSWRRA 
1186.15 15 LVPFVQWFVGLSPTV 
1186.18 16 NLSWLSLDVSAAFYH 

1 1 86.25 1 5 SFGVW1RTPPAYRPP 

1 1 86.26 1 5 5PFLLAOFTSAICSV 

11 86.27 1 S SSNLSWLSLDVSAAF 
1188.01 15 DKELTMSNVKNVSQT 
1188.13 15 AGUjSNVSTVLLGGV 
11 88 . 1 6 15 KSKYKLATSVLAGLL 
1188.32 1 5 GLAYKFWPGAATPY 

1 1 88.34 1 5 HNWVNHAVPLAMKU 

1188.35 IS IGPFMKAVCVEVEKT 
1188.38 1 5 KYKJAGG1AGGLALL 

1 1 88.45 1 5 RHNWVNHAVPLAMKL 

F091 .15 16 IKORNMWQEVGKAMY 

F1 07.03 16 LQSLTNLLSSNLSWL 

F 107.04 15 PFOAOFTSAlCSW 

F 1 07.00 1 5 KYKLATSVLAGLLGN 

F 107.10 15 LAGLLGNVS7VLLGG 

F107.11 15 RHPFWGSSOPADNA 

F107.14 15 ANQLW1LTDGIPOS 

F 1 07. 1 7 16 KFWPGAATPYAGEP 

F 1 07.23 1 5 VFNWNSSIOUMVL 

35.0093 15 VGPLTVN EKRRLKU 

35.0096 1 5 EStt.WCFSOF5RGN 

35.0100 15 LCQVFADATFTGWGL 

35.0106 1 5 VWVATDALMTOYTG 

35.0107 15 TVOFSLOPTFTIETT 
35.0125 15 AETFYVDGAANRETX 



HIV1 POL 711 
HIVl POL 750 
HIV1 POL 822 
HIV1 POL 911 
HIV1 POL 915 
HIV1 POL 956 
HIV1 POL 982 
HIV1 V1F 2 
H1V1 VPR 57 
Pf CSP 413 
Pf CSP 53 
PI CSP 2 
Pf CSP 417 
Pf CSP 410 
Pf EXP1 1 
Pt LSA1 11 
Pf LSA1 3 
Pf LSA1 16 
Pf LSA1 13 
Pf LSA1 15 
Pf LSA1 94 
Pf SSP2 514 
Pf SSP2 6 
Pf SSP2 223 
Pf SSP2 220 
HBV POL 534 
HBV ENV 342 
HBV POL 526 
HBV ENV 339 
HBV POL 422 
HBV NUC 121 
HBV POL 522 
HBV POL 420 
Pf LSA1 81 
Pf EXP1 82 
Pf EXP1 71 
Pf SSP2 512 
Pf SSP2 62 
Pf SSP2 227 
Pf SSP2 494 
Pf SSP2 61 
HIV1 ENV 566 
HBV POL 412 
HBV POL 523 
Pf EXP1 73 
Pf EXP1 81 
Pf EXP1 107 
Pf SSP2 153 
Pf SSP2 516 
Pf CSP 412 
HBV POL 96 
HBV POL 387 
HBV POL 683 
HCV 1437 
HCV 1466 
HIV POL 619 



2 
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Class II Peptides 

P«pHd« A A . Sequonc* Source 



35.0127 

35.0131 

35.0133 

35.0135 

35.0171 

35.0172 

1280.02 

1280.03 

1280.04 

12B0.06 

1280.08 

1280.09 

1280.12 

1280.13 

1280. 16 

1280.16 

1280.21 

1280.22 

1280.23 

1280.25 

1283.02 

1283.10 

1283.11 

1283.12 

1283.13 

1283.14 

1233.16 

1283.17 

1283.20 

1283.21 

1283.22 

1283.24 

1283.26 

1283.30 

1283.31 

1283.33 

1283.34 

1283.36 

1283.37 

1283.44 

1283.SO 

1283.55 

1283.57 

1283.61 

1298.02 

1208.03 

1208.04 

1208.06 

1298.07 

1298.08 

1298.10 

1208.11 

1298.13 

1208.16 

F1 25.02 

F1 25.04 



1 5 EVNIVTDSQYALGII 

15 WAGJKQGFG1PYNPQ 

1 5 GAWODNSD1KWP 

1 5 YRKILRQRWORUO 

1 5 POSIODSLKESRKLN 

1 5 KCNLYADSAWENVKN 

1 5 K3TVLVGPTPVNWQ 

1 6 KVYLAWVPAHKQK3G 

1 5 TKELOKOfTKIONFR 

1 5 AGFFLITRIIT1POS 

1 5 GFFU.TRILT1PQSL 

1 5 QTSFVYVPSALNPAD 

1 5 IIFLFILLLCUFLL 

1 5 KFAVPNLQSLTNUS 

1 5 LHLYSHPIILGFRW 

15 aCUFlLVODYOQ 

1 S VGLLGFAAPFTOCGY 

1 5 FYF1LVNLUFHING 

1 5 KSLLRNLGVSEN1FL 

1 5 RGYY1PHQSSLPODN 

1 5 VYUPRRGPRLGVRA 

1 5 GHRMAWDMMMNWSPT 

15 CGFVYCFTPSPWVG 

1 5 VYCFTPSPWVOXTD 

1 5 GNWFGCTWMNSXGFT 

1 5 FTTLPALSTGUHLH 

1 5 SKGWRLLAPITAYAO 

1 5 DLYLVTRHADVIPVR 

1 5 AOGYKVLVLNPSVAA 

1 5 GYKVLVLNPSVAATL 

1 5 VLVLNPSVAATLGFG 

1 5 GARLWLATATPPGS 

1 5 DWWATDALMTGYT 

1 S FTGLTHIDAHFLSQT 

1 5 YLVAYQATVCARAQA 

15 LEWTSfTWVLVGGVL 

1 5 TWVLVGGVLAALAAY 

1 5 AKHMWNFlSGIOYtA 

1 5 IQYLAGLSTLPGNPA 

1 5 MNRUAFASRGNHVS 

1 5 SYTWTGALITPCAAE 

15 GSSYGFOfYSPOORVE 

1 S LEUTSCSSNVSVAH 

1 5 ASCLRKLGVPPUWW 

1 5 VGNFTGLYSSTVPVF 

1 5 TNFLLSLGIHINPNK 

1 5 KQCFRKIPVNRP1DW 

1 5 KQAFTFSPTYKAFIC 

1 5 AANWILRGTSFVYVP 

1 S PORVHFASPLHVAWR 

1 5 IRPWSTQLULNGSL 

1 5 RSELYKYKWWEPL 

1 5 DRFYKTLRAEQASQE 

1 5 KVILVAVHVASQYIE 
1 7 LVNLUFHINGKIIKNS 

1 6 RHNWVNHAVPLAMKU 



HIV POL 674 
HIV POL 874 
HIV POL 989 
HIV VPU 31 
Pf SSP2 165 
Pf SSP2 21 1 
HIV POL 152 
HIV POL 712 
HIV POL 052 
HBV ENV 180 
HBV ENV 181 
HBV POL 774 
HBV ENV244 
HBV POL 406 
HBV POL 501 
HBV ENV 251 
HBV POL 637 
Pf LSA1 0 
Pf LSA1 08 
Pf LSA1 1669 
HCV Core 34 
HCV El 315 
HCV NS1/E2 506 
HCV NS1/E2 509 
HCV NS1/E2 550 
HCV NS1/E2 684 
HCV NS3 1025 
HCV NS3 1134 
HCV NS3 1251 
HCV NS3 1253 
HCV NS3 1256 
HCV N53 1345 
HCV NS3 1436 
HCVNS3 1567 
HCV NS3 1591 
HCV N64 1658 
HCV NS4 1664 
HCV NS4 1767 
HCV NS4 1777 
HCV NS4 1921 
HCV NSS 2456 
HCV NS5 2641 
HCV NS5 2813 
HCV NS5 2930 
HBV POL 53 
HBV POL 568 
HBV POL 61 5 
HBV POL 661 
HBV POL 764 
HBV POL 824 
HIV1 ENV 333 
HIV1 ENV 637 
HIV1 GAG 333 
HIV1 POL 813 
Pf LSA1 13 
Pf SSP2 61 
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WHAT IS CLAIMED IS: 

1. A composition comprising an isolated peptide that induces a CTL 
response and a T helper peptide comprising a motif of about nine residues wherein the first 
position from the N terminus of the motif is Y, F, W, L, I, V, M and the sixth position 

5 from the N terminus of the motif is S, T, C, A, P, V, I, L, M. 

2. The composition of claim 1, wherein the T helper peptide consists of 
between about 10 and about 24 residues. 

10 3. The composition of claim 1, wherein the T helper peptide is derived 

from a viral antigen. 

4. The composition of claim 3, wherein the viral antigen is from HIV, 
HBV, or HCV. 

15 

5. The composition of claim 1, wherein the T helper peptide is derived 
from a parasite. 

6. The composition of claim 5, wherein the antigen is Plasmodium 

20 falciparum. 

7. The composition of claim 1, wherein the peptide that induces a CTL 
response is linked to the T helper peptide. 



25 8. A method of inducing a CTL response in a patient, the method 

comprising contacting a cytotoxic T cell from the patient with an isolated peptide that 
induces a CTL response and a T helper peptide comprising a motif of about nine residues 
wherein the first position from the N terminus of the motif is Y, F, W, L, I, V, M and the 
sixth position from the N terminus of the motif is S, T, C, A, P, V, I, L, M. 
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9. The method of claim 8, wherein the step of contacting is carried out 
by administering to the patient a pharmaceutical composition comprising the nucleic acid 
encoding the peptide that induces a CTL response and the T helper peptide. 

10. The method of claim 8, wherein the the peptide that induces a CTL 
response is linked to the T helper peptide. 



11. A composition comprising a peptide as shown in Table VIII. 

12. A method of inducing a helper T cell response in a patient, the 
method comprising contacting a helper T cell with a peptide of claim 1 1 . 

13. The method of claim 12, wherein the step of contacting is carried 
out by administering to the patient a pharmaceutical composition comprising the peptide. 



14. The method of claim 12, wherein the step of contacting is carried 
out by administering to the patient a pharmaceutical composition comprising a nucleic acid 
encoding the peptide. 
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Box I Observation* wh»r» certain datma wert found unsearchable (Continuation of item 1 of first sheet) 
This irtcrnaiiooal report has not been established m respect of certain claims under Article 17(2) (a) for the following reasons: 
1. (~[ Claims Nos.: 

1—1 because they relate to subject matter not required to be searched by this Authority, namely: 



2. I I Claims Noa.: 

— because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. Q Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Boa II Observations where unity of invention to lacking (Continuation of item 2 of first sheet) 
This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



1. I 1 As all required additional search feet were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. Q As ail searchable claims oould be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. [™| As only some of the required additional search fees were timely paid by the applicant, this international search report coven 

only those claims for which fees were paid, specifically claims Noa.: 



^] No required additional search fern wen timely paid by the applicant. Consequently, this international search report is 
r es tri cted to the invention first mentioned in the claims; it is covered by claims Nos.: 
first species 



Remark on Pretest Q The additional search fees were accompanied by the applicant's protest. 

I | No protest saco tn panied the payment of additional search fees. 
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A. CLASSIFICATION OF SUBJECT MATTER: 

IPC (6): 

A61K 38/08, 38/10, 39/10,39/02, 39/12; C07K 7/00, 14/005, 14/20, 14/195, 14/725; C07H 21/04 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions ai followi: 



This application contains claims directed to more than one species of the generic invention. These species are deemed 
to lack Unity of Invention because they are not so linked as to form a single inventive concept under PCT Rule 13. 1. 
In order for more than one species to be searched, the appropriate additional search fees must be paid. The species are 
as follows: Each of the peptides claimed in claim 11 ( the peptides listed in Table VIII). 

The species listed above do not relate to a single inventive concept under PCT Rule 13.1 because, under PCT Rule 
13.2, the species lack the same or corresponding special technical features for the following reasons: Each of the 
peptides listed in Table VIII have unique amino acid sequences. Each of the peptides of Table VIII have distinct 
biochemical structure and properties, ie ability to bind to a particular HLA molecule or elicit peptide specific antibody. 
Accordingly* the peptides disclosed in Table VIII are not so linked by a special technical feature within the meaning of 
PCT Rule 13.2 so as to form a general inventive concept. 
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